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Abstract. 

We analyze the prospect for observing the intermediate neutral Higgs boson (/12) in its decay 
to two lighter Higgs bosons (hi) at the presently operating hadron colliders in the framework of 
the CP violating MSSM using the PYTHIA event generator. We consider the lepton+ 4-jets+ 
Ifir channel from associate Wfi2 production, with Whi — > Whihi — > Ivtbbbb. We require two or 
three tagged fe-jets. We explicitly consider all relevant Standard Model backgrounds, treating 
c-jets separately from light flavor and gluon jets and allowing for mistagging. We find that 
it is very hard to observe this signature at the Tevatron, even with 20 fb _1 of data, in the 
LEP-allowed region of parameter space due to the small signal efficiency. At the LHC, a priori 
huge SM backgrounds can be suppressed by applying judiciously chosen kinematical selections. 
After all cuts, we are left with a signal cross section of around 0.5 fb, and a signal to background 
ratio between 1.2 and 2.9. According to our analysis this Higgs signal should be viable at the 
LHC in the vicinity of present LEP exclusion once 20 to 50 fb -1 of data have been accumulated 
at y/s = 14 TeV. 



1. Introduction 

The Minimal Supersymmetric Standard Model (MSSM) pQ requires two Higgs doublets, leading 
to a total of five physical Higgs bosons. At the tree level, these can be classified as 
two neutral CP-even bosons ((pi and <p2 ), one neutral CP-odd boson (a) and two charged 
bosons. In the presence of CP violation, the three neutral Higgs bosons can mix radiatively 
[2J |3]. The mass eigenstates hi, hi and /13 with m/ il < < can then be obtained 

from the interaction eigenstates (pi, (p2 and a with the help of the orthogonal matrix O a i, 
((pi, 4>2: a)a = O a i(hi, hi, h%)J , which diagonalizes the Higgs boson mass matrix. O depends on 
various parameters of the SUSY Lagrangian. 

Due to this mixing, the Higgs mass eigenstates are no longer CP eigenstates. Moreover, 
the masses of the Higgs bosons, their couplings to SM and MSSM particles, and their 
decays are significantly modified [3j. For example, the Higgs boson couplings to pairs of 
gauge bosons are scaled by ghiVV relative to the SM. These couplings can be expressed as 
9hiW = cos ft Ofai + sin/3 Ofj, 2 i , where tan (3 is the ratio of Higgs vacuum expectation values 
(VEVs). The magnitude of gh 2 ww is directly related to the production process studied in this 
paper. 

In the absence of mixing between neutral CP-even and CP-odd states the LEP experiments 
were able to derive absolute lower bounds of about 90 GeV on the masses of both the lighter 



CP-even Higgs and the CP-odd boson [3]. However, in the presence of CP violation, the LEP 
experiment were not able to exclude certain scenarios with very light h\. In this "LEP hole" h\ 
is dominantly a CP-odd state with almost vanishing coupling to the Z boson. One then has to 
search for Zh 2 or hih 2 production. In part of the LEP hole, these cross sections are suppressed 
by the rather large h 2 mass. Moreover, h 2 — > h\h\ decays lead to quite complicated final states, 
which often yield low efficiencies after cuts. One LEP-allowed region has m/^ less than 10 GeV, 
so that hi —> t + t~ is dominant; in the other, m/ ll ~ 30 — 50 GeV so that hi — > bb is dominant. 
77J/J2 lies between slightly below 90 and slightly above 130 GeV. Scenarios with even lighter hi 
are excluded by decay-independent searches for Zh 2 production [3J. If is much above 130 
GeV, the CP-odd component of hi becomes subdominant, so that the cross section for Zhi 
production becomes too large. Finally, the LEP hole occurs for tan/3 in between 3 and 10 [3J. 
We analyze the prospect for observing a signal for the production of neutral Higgs bosons in the 
second of these LEP allowed regions. 

2. Numerical analysis 

In our analysis we took five different benchmark points, denoted by SI through S5 [mh 2 =130-90 
GeV with m hl =30 GeV] and two CPX . 5 -scenario [CPX-1(2) where m h2 =W2 (103) GeV with 
m hl =36 (45) GeV, which can be realized for M H + = 131.8 GeV with tan/3 = 4.02 (4.39)] of 
the MSSM [3]. We calculated the spectrum and the couplings for these two benchmark points 
using CPsuperH [3J. In our simulation we used the PYTHIA v6.408 [B] event generator with 
the SLHA [7] interfacing option. We used MadGraph/MadEvent v4 . 2 . 8 [5] for generating parton 
level SM backgrounds which were fed to PYTHIA for showering. We set the renormalization and 
factorization scale to Q = ^/I and used CTEQ5L for the parton distribution functions (PDF). 

The signal arises from pp —> Wh 2 — > tvihihi — > ivibbbb, leading to Zjjjjtfjr events, where 
I = e or [i. The effective cross section for this signal topology can be expressed as, 

^sfgnai = <tsm(pP/pP -> Wh 2 ) x g 2 h2WW x Br(h 2 -> hihi)x Br (hi -> bb) 2 x 2Br(W -> ev e ) , (1) 

where gh 2 ww ls the h 2 WW coupling in units of the corresponding SM value, W stands for 
and the factor 2 is for t = e and \i. This process has recently been studied in refs.[9l [TO], using 
parton-level analyses with quite promising results. We instead performed a full hadron-level 
analysis, including initial and final state showering as well as the underlying event. We take 
9h 2 ww x Brfa -»■ hhi) x Br{hi -»■ bb) 2 = 0.50. 

We simulate our signal and backgrounds at Tevatron Run-II with y/s = 1.96 TeV. We have 
used the toy calorimeter simulation (PYCELL) provided in PYTHIA with the following criteria: 
calorimeter coverage is \r}\ < 3.64; the segmentation is given by Ar/ x A</>=0.16 x 0.098 which 
resembles the CDF detector [TT|; Gaussian smearing of the total energy of jets and leptons; a 
cone algorithm with AR(j,j) = \J Arj 2 + Ac/) 2 = 0.4 has been used for jet finding; Efj? 1 ^^ > 1.5 

GeV is considered to be a potential candidate for jet initiator; minimum summed E^ min > 10.0 
GeV is accepted as a jet and the jets are ordered in Et; leptons (Z = e, fj) are selected with 
E^ > 15.0 GeV and |?/| < 2.0 and no jet should match with a hard lepton in the event. We 
reconstructed the missing energy (pr) from all observed particles. 

The tagging of b— jets plays a crucial role in our analysis. A jet with \rjj\ < 1.2 and Eh > 15 
GeV "matched" with a b— flavored hadron (B— hadron), i.e. with AR(j,B — hadron) < 0.2, is 
considered to be "taggable". We assume that such jets are actually tagged with with probability 
efr = 0.50 [12]. We find that our tagging algorithm agrees well with the tt analysis of CDF |13j . 
We also modeled mistagging of non— b jets as b— jets, treating c— jets differently from those due 
to gluons or light quarks with e c = 0.10 and e u ,d,s,g = 0.01. 

The cross-sections for the signal benchmarks lie in between 0.045 — 0.16 pb while the total 
background is approximately 455 pb. We have displayed the raw number of events for signal 
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Figure 1. The four-jet invariant mass re- 
distribution after all cuts for signal scenarios 
SI, S3 and S5 and for the total background 
(ToB), requiring triple b— tag following the 
last column of Table [1] at Tevatron. 




m 4j (GeV) 

Figure 2. The four-jet invariant mass re- 
distribution after all cuts for signal scenarios 
SI, S3 and S5 and for the total background 
(ToB), requiring triple b— tag following the 
last column of Table [1] at LHC. 



and backgrounds in the first column in Table [TJ The second column is the number of events 
after applying the following basic acceptance cuts: iVj et > 4, EjT" > 10 GeV, |r/- ?=1_4 |<3.0; 
-^lepton > Ij-^f > 15 GeV, |?/|<2.0 and > 15 GeV. Not surprisingly, ti is the main source of 
background at this stage. We found that the suppressions by applying Nb-tag > 2 are almost the 
same for signal and ti. The signal contains more b quarks, but the it background has much harder 
b jets, leading to larger tagging probabilities. However, this background can contain a third b 
jet only due to showering. Hence requiring N^-tag > 3 greatly reduces the background and 
increases the signal-to-background ratio. Unfortunately the small triple b— tagging probability, 
which lies between 1.3 and 3.6% depending on m/j 2 , also reduces the signal rate. 

We have also looked for invariant mass peaks to isolate the signal on top of a sizable 
background. In addition to the basic acceptance cuts (defined above) we demand that the signal 
contains exactly (rather than at least) four jets. This reduces combinatorial backgrounds for 
Higgs mass reconstructions. Finally, we pick the jet pairing (ij)(kl) (with i,j,k,l £ {1,2,3,4}) 
that minimizes the difference rrt^-. —mj k j l of di-jet invariant masses; in the absence of showering 
and for perfect energy resolution, the signal would have rn^j . = rrij k j l = ■ We then demand 
that both rrij^. and frij k j l lie between 10 and 60 GeV, where the lower bound comes from the 
requirement that h\ —¥ bb decays should be allowed, and the upper bound from the requirement 
that /12 — > h\h\ decays should be open. We next require the four-jet invariant mass to lie 
between 60 and 140 GeV; this covers the entire "LEP hole" in the MSSM Higgs parameter 
space. Table Q] shows that after these cuts, the signal actually exceeds the total background. 

The results of the column labeled Eff3 in Table Q] have been obtained by including all events 
that pass the other cuts (not related to tagging) and have at least three taggable jets, assigning 
each event a weight given by its (mis)tagging probability. This greatly increases the statistics. 
We checked that this gives results that are consistent with the event rejection technique whenever 
the latter has good statistics; this is the case if at most one b— tag results from mistagging. 

The distribution of m^j is shown in Fig. [T]for signal scenarios SI, S3 and S5 as well as for the 



Table 1. Process column shows the signal benchmarks and the SM backgrounds, where j stands 
for u,d,s,g and - for background process 7 and 8 - c. RawEvt is stands for the number of events 
produced in the experiments (for backgrounds we applied basic pre-selections in the generator 
level: p J ^ b > 5 GeV, 1 77 Jr ' fo | <5.0 and AR(jj,bb,bj) > 0.3). N acc is the number after the basic 
selection cuts, whereas N^b is with at least three jets tagged as b— jets, allowing for mistagging. 
Eff3 is the number of events passing the selection cuts that contain exactly four jets and at least 
36— tagged jets; the numbers in parentheses represent the number of events with the inclusion 
of 

m pa.ir and m^j cuts. Finally, ToB is the total number of background events. 
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total background. We observe clear peaks for the signal, which are shifted downwards due to, 
for example, showering (by 10 to 15 GeV) from the naive expectation = mh 2 at the parton 
level. 

By seeing the distributions of m pa [ T (average of the two optimal pairing of di-jet invariant 
masses) and m^j (see Fig. [1]) allow us define the final significance of the signal by counting 
events that satisfy: 0.6m/ ll < m pa i r < m/ ll + 5 GeV and 0.7mh 2 < m^j < rrih 2 + 10 GeV. We 
see that requiring triple b— tags leads to very good signal to background ratio, of around 10 for 
= 30 GeV and slightly less for heavier h\. However, we expect less than 2 signal events 
after all cuts for a total integrated luminosity (/ Cdt) of 20 fb _1 . We tried also to see the effect 
for two b— tag and found that the signal rate increase by a factor between 3.7 and 5 while the 
background increases by two orders of magnitude and S/ VB is well below two. 

The significance defined in this way overestimates the true statistical significance of a double 
peak in the m pa i r and m^j distributions somewhat, due to the "look elsewhere" effect: since m/ ll 
and 771/J2 are not known a priori, one would need to try different combinations when looking for 
peaks. However, given that we use rather broad search windows, there are probably only O(10) 
statistically independent combinations within the limits of the LEP hole. Note also that the 



signal rate is still quite small. Further kinematical cuts, which might slightly increase the signal 
to background ratio, are therefore not likely to increase the statistical significance of the signal. 
We are therefore forced to conclude that the search for Whi —¥ Wh\h\ — > lubbbb events at the 
Tevatron does not seem promising, and turn instead to the LHC. 

Our analysis for the LHC follows broadly similar lines as that for the Tevatron. We simulate 
our signal and backgrounds at the LHC with yfs = 14 TeV. The PYCELL model is based on the 
ATLAS detector [14] with calorimeter coverage \r)\ < 5.0, segmentation Ar] x A(p = 0.087 x 0.10. 
We use a Gaussian energy resolution for leptons and jets and a cone algorithm for jet finding, 
with AR(j,j) = 0.4. Calorimeter cells with E| cl ^ in > 1.0 GeV are considered to be a potential 
candidates for jet initiator. All cells with E^^m > 0.1 GeV are treated as part of the would-be 
jet; minimum summed E ^,^ > 15.0 GeV is accepted as a jet and the jets are ordered in Et] 

leptons (£ = e, fj) are selected if they satisfy E^ > 20 GeV and \rf \ < 2.5. The jet-lepton 
isolation criterion and the missing transverse energy ftp are adopted similar to our Tevatron 
analysis. 

Only jets with \rjj\ < 2.5 are considered to be taggable as b— jets. If the jet is "matched" 
to a b— flavored hadron, with AR(j, hadron) < 0.2, the tagging efficiency is taken to be 50%. 
If instead the jet is matched to a c— hadron, the (mis)tagging efficiency is taken to be 10%, 
whereas jets matched to a r— lepton have zero tagging probability. All other taggable jets have 
(mis)tagging probability of 0.25%. These efficiencies follow recent ATLAS analyses |15|. I16| . fTT] . 

The cross-sections for the signal benchmarks lie between 0.5 — 1.7 pb while the total 
backgrounds is approximately 2.84 • 10 4 pb. We have displayed the raw number of events 
for signal and backgrounds (again with the same generator-level cuts as of Tevatron) in the 
first column in Table [TJ The second column is the number of events after applying the basic 
acceptance cuts: A jet > 4, Elf 1 ' 4 > 15 GeV, |r? i=1 - 4 | <5.0; A lepton >1,E^> 20 GeV \rf\<2.5 
and f)r > 20 GeV. They reduce the cross section by about a factor of 5 (3) for nih 2 = 90 (130) 
GeV. 

The number of events (for / £dt=10 fb _1 ) passing the acceptance cuts and containing exactly 
four jets, at least three of which are tagged (adopting the same strategy like Tevatron), is given 
by the Eff3 column of Table [TJ Similar to Tevatron we require both jet pair invariant masses 
to lie between 10 and 60 GeV and m^j to lie between 60 and 140 GeV. The m^j distribution is 
shown in Fig. [21 After these cuts we are left with slightly less than one signal event and slightly 
less than two background events per fb _1 of data. A 5<r signal would then require almost 100 
fb _1 of data, more than the LHC is likely to collect during "low" luminosity running. 

We also checked that requiring a fourth 6— tag reduces the signal cross section by another 
order of magnitude or more. The signal rate then becomes so low that one would have to wait 
for the high-luminosity phase of the LHC to accumulate enough events to reconstruct invariant 
mass peaks. We therefore stick to triple b— tag in our LHC analysis. 

We found that the background shows a peak in the m pa i r distribution between 30 and 40 
GeV, not far from the peak of the signal in the scenarios we consider. A tighter cut on m pa i r 
will nevertheless improve the signal-to-background ratio. Moreover, the four-jet invariant mass 
distribution (in Fig. [2]) of the background peaks at large values, largely due to the contribution 
from ti production. At least for scenarios with hi masses in the lower half of the "LEP hole" 
region a tighter cut on m$j will therefore also improve the significance of the signal. We therefore 
applied the same "double-peak" cuts as at the Tevatron. The signal then always exceeds the 
background. Assuming an integrated luminosity of 60 fb _1 at the end of "low" luminosity 
running, we find a final statistical significance of at least 5 standard deviations, and a signal 
sample of some 30 events. 



3. Conclusions 

We analyzed the possibility of observing neutral Higgs bosons at currently operating hadron 
colliders in the framework of the CP violating MSSM. We explored the tjjjjftr channel with 
double, triple and quadruple b tag, focusing on the region of parameter space not excluded by 
LEP searches. We considered a large number of SM backgrounds and employed a full hadron- 
level Monte Carlo simulation using the PYTHIA event generator. We carefully implemented 
b— tagging, including mistagging of c— jets or light flavor or gluon jets. At the Tevatron, requiring 
36— tag, we can only expect about one signal event per 10 fb _1 of J Cdt, on a background of 
about 0.3 events. If we require only 2b— tag, the signal increases by a factor of about 4, but 
the background increases by two orders of magnitude, making the signal unobservable. At the 
LHC we focussed on events with exactly four jets, cutting simultaneously on the average di-jet 
invariant mass and the four-jet invariant mass and demanding at least 36-tags. We found a 
signal rate above the background, and a signal significance exceeding 5a for an J Cdt of 60 fb -1 . 
One might be able to increase the S:B ratio even more by requiring 4b— tags with softer tagging 
criteria (enhancing mistag rate also), possibly simultaneously relaxing the requirement on the 
number of jets. This could be used to confirm the existence of a signal. 

We conclude that searches for Wh<i production at the LHC with W —> Iv and hi —}h\h\—t 
bbbb should be able to close that part of the "LEP hole" in parameter space where h\ —> bb 
decays dominate. The details of this analysis can be found in our recent paper [18] . 
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